🎮 Reinforcement Learning - recaip · Scour

Found-RL: foundation model-enhanced reinforcement learning for autonomous driving

arxiv.org·12h

Show HN: Fighting the War Against Expensive Reinforcement Learning

cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app·10h·

Discuss: Hacker News

Blockwise Advantage Estimation for Multi-Objective RL with Verifiable Rewards

arxiv.org·12h

check out this article on Reinforcement Learning with R: Origins, Real-Life Applications, and Practical Implementation

dev.to·2d·

Discuss: DEV

A multi-agent reinforcement learning approach to autonomous aircraft taxiing with taxiing time, fuel consumption, and emission optimization

sciencedirect.com·1d

Optimizing post-disaster road restoration with reinforcement learning: A traveler-behavior-aware approach

sciencedirect.com·1h

🏕️Survivalism

A training principle for drifting models

breno.bearblog.dev·6h

🤖Machine Learning

Observe emergent behavior in autonomous multi-agent LLM networks

agents.glide2.app·2d·

Discuss: Hacker News

Robotics Motion Learning: Training Linked Robot Arms with Kuramoto Models

hackernoon.com·1d

Multi AI Agent Systems with crewAI

deeplearning.ai·6h

YORU: Animal behavior detection with object-based approach for real-time closed-loop feedback

science.org·1d

Repo Optimizer: I Let a KISS AI Agent Optimize Itself Overnight. It Cut Its Own Cost by 98%.

dev.to·2h·

Discuss: DEV

How to Leverage Explainable AI for Better Business Decisions

towardsdatascience.com·2h

A Conceptual Framework for Exploration Hacking

lesswrong.com·1h

GLM-5: From Vibe Coding to Agentic Engineering

simonwillison.net·22h·

Discuss: Hacker News

Feedback Control for Computer Systems

janert.org·10h

The 4 Mixture of Experts Architectures: How to Train 100B Models at 10B Cost

pub.towardsai.net

·4h

JupyterPS/VBAF: Visual Business Automation Framework - PowerShell-based reinforcement learning for education and business automation

github.com·2d·

Discuss: Hacker News

A masterclass in AI security operations

redcanary.com·4h

Recursive self-improvement from AI models

marginalrevolution.com·1d·

Discuss: Hacker News

Loading more...